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Abstract — Opportunistic scheduling and beamforming 
schemes with reduced feedback are proposed for MIMO- 
OFDMA downlink systems. Unlike the conventional beamforming 
schemes in which beamforming is implemented solely by the 
base station (BS) in a per-subcarrier fashion, the proposed 
schemes take advantages of a novel channel decomposition 
technique to perform beamforming jointly by the BS and the 
mobile terminal (MT). The resulting beamforming schemes 
allow the BS to employ only one beamforming matrix (BFM) 
to form beams for all subcarriers while each MT completes the 
beamforming task for each subcarrier locally. Consequently, 
for a MIMO-OFDMA system with Q subcarriers, the proposed 
opportunistic scheduling and beamforming schemes require only 
one BFM index and Q supportable throughputs to be returned 
from each MT to the BS, in contrast to Q BFM indices and Q 
supportable throughputs required by the conventional schemes. 
The advantage of the proposed schemes becomes more evident 
when a further feedback reduction is achieved by grouping 
adjacent subcarriers into exclusive clusters and returning only 
cluster information from each MT. Theoretical analysis and 
computer simulation confirm the effectiveness of the proposed 
reduced-feedback schemes. 

I. Introduction 

Orthogonal frequency-division multiple-access (OFDMA) 
has recently attracted much attention as a promising tech- 
nique for future broadband wireless communications. In an 
OFDMA downlink system, the base station (BS) transmits 
data to several active mobile terminals (MTs) simultaneously 
by modulating each MT's data onto an exclusive set of or- 
thogonal subcarriers. In addition to its robustness to multipath 
fading and high spectral efficiency, OFDMA is particularly 
attractive due to its flexibility in allocating subcarriers to 
different MTs based on their different quality of service (QoS) 
requirements and channel conditions in a dynamic fashion 
[6]. Furthermore, the recent advancement in multiple-input 
multiple-output (MIMO) techniques has inspired considerable 
research interest in MIMO-OFDMA. However, perfect channel 
state information (CSI) is usually required at the BS in order to 
fully harvest the advantages provided by MIMO in the down- 
link transmission, which incurs formidable feedback overhead 
from MTs. This problem becomes particularly challenging 
in MIMO-OFDMA since the required feedback amount is 
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proportional to the number of subcarriers. To circumvent 
this obstacle, an opportunistic scheduling and beamforming 
scheme has been proposed for single-carrier (SC) multiple- 
input single-output (MISO) systems in [5] as an effective 
means of achieving the asymptotic sum-rate capacity by 
exploiting multiuser diversity with limited channel feedback. 
Some extensions of [5] have been developed for SC-MIMO 
systems [1], [3]. In particular, [1] has proposed a singular 
value decomposition (SVD)-based scheme to schedule data 
transmission to the MT whose channel matrix has the right 
singular vectors aligning with beams in the common codebook 
shared by the BS and MTs. Most recently, an opportunis- 
tic scheme has been proposed for single-input single-output 
(SISO)-OFDMA by grouping the adjacent subcarriers into 
exclusive clusters [4]. Assuming the subcarriers in each cluster 
have approximately the same channel conditions, [4] has 
demonstrated good throughput performance by only feeding 
the average cluster SNRs from each MT back to the BS. 

In this work, we first propose a novel beamforming tech- 
nique for MIMO-OFDMA systems. By properly decomposing 
the time-domain channel response matrix into a product of 
subcarrier-dependent components and subcarrier-independent 
components, we can divide the beamforming task into the 
subcarrier-independent part implemented by the BS and the 
subcarrier-dependent part accomplished by each MT locally. 
Next, this novel beamforming technique is proposed to be 
incorporated into the design of opportunistic and beamforming 
schemes, which results in opportunistic and beamforming 
schemes that only require feedback of one beamforming 
matrix (BFM) index and Q supportable throughputs for a 
MIMO-OFDMA system with Q subcarriers. For comparison 
purposes, opportunistic and beamforming schemes employing 
the conventional SVD technique are also proposed by extend- 
ing [1] to MIMO-OFDMA systems. While the SVD-based 
schemes can achieve the optimal throughput performance, they 
incur feedback overhead of Q BFM indices and Q support- 
able throughputs. Analytical and simulation results show that 
the proposed reduced-feedback schemes can asymptotically 
achieve the system throughput obtained by the SVD-based 
schemes. 

Notation : Vectors and matrices are denoted by boldface let- 
ters. ||-|| represents the Euclidean norm of the enclosed vector 
and | • | denotes the amplitude of the enclosed complex-valued 
quantity. Jjy is the N x N identity matrix. We use E {•}, 



(•) T and 



for expectation, transposition and Hermitian 



transposition, respectively. 



Substituting <(5j into we have 



II. Signal Model 

We consider a MIMO-OFDMA downlink system with Q 
subcarriers and K active MTs. The BS and each MT are 
equipped with M and N antennas with M < N, respectively. 
The propagation path between each pair of transmit and 
receive antennas is assumed to undergo independent slow 
frequency-selective fading and has a maximal channel length 
L. We denote by h™' n (p) the k-th MT's channel impulse 
response between the m-th transmit antenna and the n-th 
receive antenna during the p-th OFDMA block. Assuming that 
the cyclic prefix (CP) is sufficient and synchronization has 
been achieved, the received signal by the fc-th MT over its 
q-th subcarrier during the p-th OFDMA block can be written 
as 

Vk,q(P) = G k,q(p) ■ Xk,q(p) + Wk,q(p)i (1) 

where Xk. q (p) is the pre-coded data vector and Wk : q{p) 
is complex circularly symmetric white Gaussian noise with 
zero-mean and unity variance. Furthermore, Gk, q (p) is the 
corresponding frequency-domain channel matrix computed as 



r T 



Gk, g (p) 



hi' 2 (p) 



■ K (p) 



e q h k 

e T „hf\p) 



K (p) 



:h M > N (j>) 



(2) 

with ei being the vector containing the first L elements of the 
z-th column of a Q-point discrete Fourier transform (DFT) 
matrix W whose entry is given by [W] t u — exp (^—^jjp^- 
for < £,u < Q - 1. 

In the following, we concentrate on the fc-th MT over the 
p-th OFDMA block. For presentational clarity, we omit the 
MT and block indices, i.e. k and p, in the sequel. 

III. Proposed Schemes 

A. Per-subcarrier reduced-feedback opportunistic and beam- 
forming scheduling (PS-RF-OS) 

To begin with, we first rewrite G q into the following form. 



G q = [I N <g> e£] 
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where ® denotes the Kronecker product. 
Next, H is singular-value decomposed as 



H = U £ V 



H 



(3) 



(4) 



(5) 



where U and V are unitary matrices of dimension NL x NL 
and M x M, respectively. Furthermore, S is a diagonal matrix 
with min {NL,M} non-zero diagonal elements. 



G a 



[I N ® e T q ] ■ U ■ S ■ V H 



= Q q R q V 



H 



(6) 
(7) 



where the last equality is obtained by the following QR 
decomposition : 



[I N ® e T q ] ■ U ■ S - Q q ■ R q , 



(8) 



with Q q being a unitary matrix of dimension M x M and R q 
an upper triangular matrix of dimension M x N. 

It is important to observe that V is independent of the 
subcarrier index q and subsequently, can be employed as 
the BFM at the BS for all subcarriers. Furthermore, the 
beamforming for each subcarrier is characterized by Q q which 
is known to the MT. Finally, in contrast to the conventional 
SVD-based channel decomposition where the channel matrix 
is decomposed as a product of two unitary matrices and a 
diagonal matrix, the diagonal matrix is replaced by an upper 
triangular matrix R q in @. By exploiting these characteristics 
of ©, we propose the following reduced-feedback opportunis- 
tic scheme for MIMO-OFDMA. 

Assuming perfect channel estimation is achieved by each 
MT, we find a unitary BFM V satisfying V H Vd = Im from 
a common codebook of 2 B entries shared by the BS and all 
MTs. We define 



clef 



V H V d 



It is easy to show that < "f m ,m < 1 f° r m = 1,2, 
The following criterion is employed to select Vd- 



argmax 



M 
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(9) 



(10) 



Obviously, it is more likely to achieve V H Vd* = Im if the 
codebook contains more candidate BFMs at the cost of more 
feedback bits. 

The integer d* is then returned to the BS and Vd* is 
employed to pre-coded the data symbols as 



Xn = V d * 



(ID 



where s q is the data vector of length M and _E |||s f; || 2 | = 
pM. 

Assuming V H Vd* = Im and recalling ©, we pre-multiply 
on both sides of ([1} and have 



- Q H q 

-R/7 ' Sq 



y- 



(12) 
(13) 



where w' q — Q^w q . 

Since R q is an upper triangular matrix, the vertical Bell 
Laboratories Layered Space-Time (V-BLAST) receiver [2] can 
be employed to detect s q {m) for m = M, M — 1, • • • , 1 
sequentially, where s q {m) is the m-th entry of s q . It has been 
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Fig. 1. Schematic diagram of the proposed per-subcarrier reduced-feedback opportunistic scheduling and beamforming scheme (PS-RF-OS). 



shown in [?] that the adverse effect of error propagation in V- 
BLAST receivers is negligible for the high SNR region. As a 
result, ( fT3] l can be rewritten as M equivalent parallel channels: 



1,2,-. -,M, (14) 



where \R a , ... ,„ ,« u,^>nu„ ».< .«...„. 

supportable data throughput on the q-th subcarrier is given by 



is the m-th diagonal element of R„. Thus, the 
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However, only V H V rj Jm holds for practical systems 
with a finite codebook. Thus, the actual supportable throughput 
can be approximated as 



M 



C a 



l0 S2 1 + P 



m— 1 
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(16) 



Finally, each MT returns to the BS one integer BFM 
index d* as well as Q real-valued supportable throughputs 
{C q ;q — 1,2, ■■■ ,Q}. After receiving information from all 
MTs, the BS assigns each subcarrier to the MT with the 
highest supportable data throughput. In the sequel, this scheme 
is referred to as the per-subcarrier reduced-feedback oppor- 
tunistic scheduling and beamforming scheme (PS-RF-OS). PS- 
RF-OS is depicted in Fig. Q] and summarized as follows. 
1) In the beginning of a time slot, each MT estimates its 
time-domain channel response matrix H by exploiting 
a pilot signal broadcast from the BS; 
Each MT computes Q q , R q and V according to ©; 
The best BFM Vd* is selected from the common code- 
book based on the selection criterion (TTOt ; 

4) Finally, each MT feeds d* and {C q } back to the BS; 

5) Taking fairness in account, the BS assigns subcarriers to 
each MT based on {Cq}; 

6) The BS transmits data pre-coded with Vd- throughout 
the current slot; 

7) The above procedures are repeated in the next slot. 



2) 
3) 



B. Per-cluster reduced-feedback opportunistic and beamform- 
ing scheduling (PC-RF-OS) 

Inspired by [4], a further feedback reduction can be achieved 
by dividing the Q subcarriers into G exclusive clusters com- 
posed of U adjacent subcarriers with Q = G x U. Assuming 
that the channel conditions are approximately constant over 
the U subcarriers in the same cluster, the following average 
supportable data throughput of each cluster is fed back to the 

(17) 



C 3 ~ J; 53 C 1 

qelg 



where T g = {i g> i, i g .2, ■ ■ ■ , ig,u} is the subcarrier index set of 
the g-th cluster. 

As a result, only one integer-valued d* and G real-valued 
supportable throughputs are required to be fed back to the 
BS. In the sequel, this scheme is referred to as the per-cluster 
reduced-feedback opportunistic scheduling and beamforming 
scheme (PC-RF-OS). 

C. Per-subcarrier eigen-beamforming opportunistic schedul- 
ing (PS-EB-OS) 

For comparison purposes, two eigen-beamforming schedul- 
ing schemes are proposed in the following. These two schemes 
can be considered as a MIMO-OFDMA extension of the SC- 
MIMO scheme proposed in [1] and require the conventional 
amount of feedback. For each subcarrier, its channel matrix 
G q is decomposed as 



G 



U q -X q - V q 



H 



(18) 



where U q and V q are unitary matrices of dimension NxN and 
M x M, respectively. Furthermore, S g is a diagonal matrix 
whose first M diagonal elements are the singular values of G q , 
X q _i for i — 1, 2, • • • , M. In contrast to ©, V q is dependent 
on the subcarrier index q. 

Thus, the transmitted signal can be pre-coded with a BFM 
Vq chosen from the common codebook in an approach similar 



to ( fTOb . Assuming V H V q = I, we pre-multiply the received 
signal in (Q~|) with U q and obtain 



K = U„ H y = S 9 s 9 + w 



(19) 



where w' q ' = U^w q . 

Finally, we can have the resulting supportable data through- 
put as 

M 



f q =Y^ log 2 (l + 9 



q , m | 



m— 1 



(20) 



Jm for a finite codebook, the actual 
supportable throughput can be approximated by 



Since V q V q 



M 

T V~J2 l0 S2 ( 1 + P \ X 1,mf 7m,r, 



m— 1 



(21) 



where is evaluated as shown in <j9j with V being 

replaced by V q . 

It is clear that this approach requires each MT to feed 
back Q BFM indices as well as Q real-valued supportable 
data throughputs, which amounts to approximately twice the 
feedback amount of PS-RF-OS. In the sequel, this scheme 
is referred to as the per-subcarrier eigen-beamforming oppor- 
tunistic scheduling and beamforming scheme (PS-EB-OS). 

D. Per-cluster eigen-beamforming opportunistic scheduling 
(PC-EB-OS) 

If subcarriers are grouped into clusters, then feedback can 
be reduced to only one BFM index and the average supportable 
throughput for each cluster. This results in a feedback reduc- 
tion by a factor of U. In the sequel, this scheme is referred to 
as the per-cluster eigen-beamforming opportunistic scheduling 
and beamforming scheme (PC-EB-OS). 

E. Remarks 

The following comparison about the four proposed schemes 
is of interest. 

1. ) PS-EB-OS/PC-EB-OS requires twice the amount of feed- 
back of PS-RF-OS/PC-RF-OS, respectively; 

2. ) In PC-EB-OS, each MT feeds one BFM index per cluster 
back to the BS. Consequently, the chosen BFM may be a poor 
approximation of the true eigen-BFM for some subcarriers in 
the cluster, which incurs severe performance degradation as 
the cluster size increases. On the contrary, PC-RF-OS entails 
only marginal performance degradation due to clustering since 
the feedback reduction involves no BFM information loss. 

3. ) It is fair to say that the substantial feedback reduction of 
PS-RF-OS/PC-RF-OS is achieved at the cost of higher com- 
putational complexity at the MTs compared to PS-EB-OS/PC- 
EB-OS since the V-BLAST receivers have to be employed to 
detect the data symbols. 



IV. Asymptotic Performance Analysis 

In this section, we compare the asymptotic data throughput 
of PS-RF-OS in |Q3} and that of PS-EB-OS in ©. Observ- 
ing that (|7]i and ( fl8T l stand for two different decomposition 
expressions of the same matrix G q , we have 



Q q ■ R q ■ V H = U q ■ S, • V q 



H 



(22) 



On taking the determinant of both sides and recalling that 
Q q , V, U q and V q are all unitary matrices, we can easily 
obtain 



det (R q ) = det (S 9 ) , 



or equivalently, 



M 



A I 



n k 



Assuming p|A g , m | > 1 and p [R q } mm 
obtain the following asymptotic relationship between the two 
data throughputs as p increases to infinity. 



(23) 
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3> 1, we can 
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Mlog 2 (p) + 21og 2 J] \[R q l 

/ M 

Mlog 2 (p) + 21og 2 }[ | A,,, 



= 1. 



(25) 



Thus, the data throughput of PS-RF-OS approaches to that of 
PS-EB-OS asymptotically as p increases. 

V. Simulation Results 

Computer simulation are conducted in this section to as- 
sess the performance of the four proposed scheduling and 
beamforming schemes. The simulated OFDMA system has 
Q — 128 subcarriers and M — N — 2 antennas. The 
channel response of each MT is generated according to the 
HIPERLAN/2 channel model with eight paths (L = 8). In 
particular, the channel coefficients are modeled as independent 
and complex-valued Gaussian random variables with zero- 
mean and an exponential power delay profile 



^{iMOl 2 } = A.cxp{-/}, I = 0, 1, • • ■ , 7. 



(26) 



where the constant A is such chosen that £'||h,fe| 2 | = 1. 
To prevent inter-block interference (IBI), a CP of length 
N g = 8 is employed. Notice that we have implicitly assumed 



a homogenous network in (1261 1. As a result, allocating the 
subcarrier/cluster to the MT with the highest throughput is 
a fair scheduling strategy. Unless otherwise specified, we set 
SNR = 10 dB, K = 10, B = 8 for all schemes and G = 8 
for PC-RF-OS and PC-EB-OS. 





— B — PS-RF-OS (B™) 
-A- PS-RF-OS (B=8) 
-*— PC-RF-OS (B=8, G=8) 

- - PS-EB-OS (Ba«) 

- - PS-EB-OS (B=8) 

- + - PC-EB-OS (B=8, G=8) 



Number of codebook bits, 1 



Number of MTs. K 



Fig. 2. Average throughput as a function of the number of bits in the 
codebook, B. 



Fig. 3. Average throughput as a function of the number of MTs, K. 



Case 1: Impact of the codebook size 

In this experiment, we investigate the impact of the code- 
book size on the system throughput. Figure [2] shows the 
average throughput as a function of B. The curves labeled 
"B = oo" serve as the performance upper bound with 
V H V d , = I M or V q H V d * = I M . Figure |2] indicates that 
the throughput of all schemes improves as the codebook size 
increases. In particular, Fig. [2] suggests that a codebook with 
B = 8 bits is sufficiently large to allow PS-RF-OS, PS- 
EB-OS and PC-RF-OS to achieve throughput performance 
approaching that achieved with B = oo. In other words, B = 8 

</* 



is sufficient to achieve V H Vd* = Im or V^Va* = Im 



Case 2: Throughput comparison 

In the second experiment, we compare the performance 
of the four scheduling schemes in terms of throughput as a 
function of K, the number of MTs. It is clear from Fig. [5] 
that the throughput performance of PS-RF-OS and PS-EB-OS 
with "B = oo" is identical, which confirms our asymptotic 
analysis in Sec. [IV] Furthermore, the performance difference 
among PS-RF-OS, PS-EB-OS and PC-RF-OS with B = 8 is 
marginal whereas the throughput of PC-EB-OS suffers from 
the reduced feedback. 

Case 3: Impact of the cluster size 

In this last experiment, we study the impact of the cluster 
size on the system throughput of PC-RF-OS and PC-EB- 
OS. Figure @] shows that PC-RF-OS entails less performance 
degradation due to adopting a larger cluster size, i.e. a smaller 
G, compared to PC-EB-OS. This is because PC-RF-OS only 
reduces the feedback information on the supportable through- 
put of each subcarrier whereas PC-EB-OS further suffers from 
the information loss about the optimal BFM. 

VI. Conclusion 

Two opportunistic scheduling and beamforming schemes, 
namely PS-RF-OS and PC-RF-OS, have been proposed for 
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Fig. 4. Average throughput as a function of the number of clusters, G. 



MIMO-OFDMA downlink systems by exploiting a novel 
beamforming technique where beamforming is performed 
jointly by both the BS and MTs. The resulting PS-RF-OS 
requires feedback of only one BFM index and Q supportable 
throughputs for a MIMO-OFDMA system with Q subcar- 
riers. A further feedback reduction is achieved by PC-RF- 
OS through grouping adjacent subcarriers into clusters and 
returning only cluster information from each MT. Since PC- 
RF-OS does not incur feedback loss regarding the BFM infor- 
mation, it only entails marginal performance degradation with 
respect to PS-RF-OS. For comparison purposes, two SVD- 
based opportunistic scheduling and beamforming schemes, 
namely PS-EB-OS and PC-EB-OS, have been developed. The 
superior performance of PS-EB-OS is obtained at the cost of 
twice much amount of feedback compared to PS-RF-OS and 
PC-RF-OS whereas the performance of PC-EB-OS decreases 
rapidly as the cluster size increases. It has been demonstrated 
through asymptotic analysis and computer simulation that PS- 
RF-OS and PC-RF-OS can achieve throughput performance 
comparable to PS-EB-OS with substantially reduced feedback. 
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